Finding outliers at multiple scales
نویسندگان
چکیده
Outlier detection targets those exceptional data whose pattern is rare and lie in low density regions. In this paper, under the assumption of complete spatial randomness inside clusters, we propose an MDV (Multi-scale Deviation of the Volume) approach to identifying outliers. In addition to assigning an outlier score for each object, it directly outputs a crisp outlier set. It also offers a plot showing the data structure in every object’s vicinity, which is useful in explaining why it may be outlying. Finally, the effectiveness of MDV is demonstrated with both artificial and real datasets.
منابع مشابه
Stability Analysis of a Strongly Displacement Time-Delayed Duffing Oscillator Using Multiple Scales Homotopy Perturbation Method
In the present study, some perturbation methods are applied to Duffing equations having a displacement time-delayed variable to study the stability of such systems. Two approaches are considered to analyze Duffing oscillator having a strong delayed variable. The homotopy perturbation method is applied through the frequency analysis and nonlinear frequency is formulated as a function of all the ...
متن کاملFinding Multiple Outliers from Multidimensional Data using Multiple Regression
The knowledge of weather is useful for finding climate change over a period. In this present frame work uses 15 years of weather of Hyderabad city , data a real time the datasets collected from weather station. Weather data is a time series and multidimensional data. Outliers are the objects whose behavior is different from the rest. Outliers in weather data represent the cyclone, drought, seas...
متن کاملIdentification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملWho Should be Interviewed? A Response from Cluster Analysis
Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews. Methods: In more detail, the algorithm (i....
متن کاملImpact of Outliers in Data Envelopment Analysis
This paper will examine the relationship between "Data Envelopment Analysis" and a statistical concept ``Outlier". Data envelopment analysis (DEA) is a method for estimating the relative efficiency of decision making units (DMUs) having similar tasks in a production system by multiple inputs to produce multiple outputs. An important issue in statistics is to identify the outliers. In this pap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- International Journal of Information Technology and Decision Making
دوره 4 شماره
صفحات -
تاریخ انتشار 2005